Optimization for the Intel
نویسندگان
چکیده
The Intel R © Itanium R © architecture contains a number of innovative compiler-controllable features designed to exploit instruction level parallelism. New code generation and optimization techniques are critical to the application of these features to improve processor performance. For instance, the Itanium R © architecture provides a compilercontrollable virtual register stack to reduce the penalty of memory accesses associated with procedure calls. The Itanium R © Register Stack Engine (RSE) transparently manages the register stack and saves and restores physical registers to and from memory as needed. Existing code generation techniques for the register stack aggressively allocate virtual registers without regard to the register pressure on different control-flow paths. As such, applications with large data sets may stress the RSE, and cause substantial execution delays due to the high number of register saves and restores. Since the Itanium R © architecture is developed around Explicitly Parallel Instruction Computing (EPIC) concepts, solutions to increasing the register stack efficiency favor code generation techniques rather than hardware approaches.
منابع مشابه
Practical Game Performance Analysis Using Intel® Graphics Performance Analyzers
Improve the performance of your games running on Intel® HD Graphics (IHD) platforms using Intel® Graphics Performance Analyzers (Intel® GPA) with the performance analysis methods presented in this article. Intel GPA is a suite of graphics performance optimization tools that enables developers to visualize, isolate and resolve graphics performance issues for Microsoft* DirectX-based games and ot...
متن کاملUsing Intel® Tools to Aid Optimization
The software development cycle requires a code optimization phase once basic functionality has been established. Describing the effective execution of this phase using Intel® Software Development Tools is the objective of this document. It is intended to serve as an introductory guide for software developers and application engineers to optimally use their time in extracting the best performanc...
متن کاملCalculation of Stochastic Heating and Emissivity of Cosmic Dust Grains with Optimization for the Intel Many Integrated Core Architecture
Cosmic dust particles effectively attenuate starlight. Their absorption of starlight produces emission spectra from the nearto far-infrared, which depends on the sizes and properties of the dust grains, and spectrum of the heating radiation field. The nearto mid-infrared is dominated by the emissions by very small grains. Modeling the absorption of starlight by these particles is, however, comp...
متن کاملUsing Intel® Quartus® Prime Software to Maximize Performance in the Intel HyperFlexTM FPGA Architecture
The innovative Intel® HyperFlexTM FPGA Architecture features a “registers everywhere” design that includes ubiquitous retiming and pipelining registers, called Hyper-Registers. These Hyper-Registers are available in every routing wire on the device. Combined with the new Intel Quartus® Prime Hyper-Aware design flow, these Hyper-Registers allow designers to break the performance barrier, achievi...
متن کاملAccelerating Virtual Product Design with Quad-Core Intel® Xeon® Processor-Based Worksta- tions and ESI Group Software
Workstation supercomputers powered by the Quad-Core Intel® Xeon® processor 5300 series enable users of ESI Group simulation-based design and optimization solutions to dramatically enhance engineering productivity and deliver higher quality designs in less time, while improving the cost-effectiveness and flexibility of the high performance computing (HPC) infrastructure. Performance Brief Quad-C...
متن کاملAccelerating Virtual Product Design with Quad-Core Intel® Xeon® Processor-Based Worksta- tions and ESI Group Software
Workstation supercomputers powered by the Quad-Core Intel® Xeon® processor 5300 series enable users of ESI Group simulation-based design and optimization solutions to dramatically enhance engineering productivity and deliver higher quality designs in less time, while improving the cost-effectiveness and flexibility of the high performance computing (HPC) infrastructure. Performance Brief Quad-C...
متن کامل